Understanding the errors of SHAPE-directed RNA structure modeling.

نویسندگان

  • Wipapat Kladwang
  • Christopher C VanLang
  • Pablo Cordero
  • Rhiju Das
چکیده

Single-nucleotide-resolution chemical mapping for structured RNA is being rapidly advanced by new chemistries, faster readouts, and coupling to computational algorithms. Recent tests have shown that selective 2'-hydroxyl acylation by primer extension (SHAPE) can give near-zero error rates (0-2%) in modeling the helices of RNA secondary structure. Here, we benchmark the method using six molecules for which crystallographic data are available: tRNA(phe) and 5S rRNA from Escherichia coli, the P4-P6 domain of the Tetrahymena group I ribozyme, and ligand-bound domains from riboswitches for adenine, cyclic di-GMP, and glycine. SHAPE-directed modeling of these highly structured RNAs gave an overall false negative rate (FNR) of 17% and a false discovery rate (FDR) of 21%, with at least one helix prediction error in five of the six cases. Extensive variations of data processing, normalization, and modeling parameters did not significantly mitigate modeling errors. Only one varation, filtering out data collected with deoxyinosine triphosphate during primer extension, gave a modest improvement (FNR = 12%, and FDR = 14%). The residual structure modeling errors are explained by the insufficient information content of these RNAs' SHAPE data, as evaluated by a nonparametric bootstrapping analysis. Beyond these benchmark cases, bootstrapping suggests a low level of confidence (<50%) in the majority of helices in a previously proposed SHAPE-directed model for the HIV-1 RNA genome. Thus, SHAPE-directed RNA modeling is not always unambiguous, and helix-by-helix confidence estimates, as described herein, may be critical for interpreting results from this powerful methodology.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical analysis of SHAPE-directed RNA secondary structure modeling.

The ability to predict RNA secondary structure is fundamental for understanding and manipulating RNA function. The information obtained from selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) experiments greatly improves the accuracy of RNA secondary structure prediction. Recently, Das and colleagues [Kladwang, W., et al. (2011) Biochemistry 50, 8049-8056] proposed a "bootstra...

متن کامل

Principles for understanding the accuracy of SHAPE-directed RNA structure modeling.

Accurate RNA structure modeling is an important, incompletely solved, challenge. Single-nucleotide resolution SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) yields an experimental measurement of local nucleotide flexibility that can be incorporated as pseudo-free energy change constraints to direct secondary structure predictions. Prior work from our laboratory has emphasi...

متن کامل

Relation Between RNA Sequences, Structures, and Shapes via Variation Networks

Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...

متن کامل

Nonlinear Analysis of a Flexible Beam Actuated by a Couple of Active SMA Wire Actuators

There are two different ways of using SMA wires as actuators for shape control of flexible structures; which can be either embedded within the composite laminate or externally attached to the structure. Since the actuator can be placed at different offset distances from the beam, external actuators produce more bending moment and, consequently, considerable shape changes with the same magnitude...

متن کامل

In silico Analysis and Molecular Modeling of RNA Polymerase, Sigma S (RpoS) Protein in Pseudomonas aeruginosa PAO1

Background: Sigma factors are proteins that regulate transcription in bacteria. Sigma factors can be activated in response to different environmental conditions. The rpoS (RNA polymerase, sigma S) gene encodes sigma-38 (&sigma;38, or RpoS), a 37.8 kDa protein in Pseudomonas aeruginosa (P. aeruginosa) strains. RpoS is a central regulator of the general stress response and operates in both retroa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Biochemistry

دوره 50 37  شماره 

صفحات  -

تاریخ انتشار 2011